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Locally Repairable Codes with Functional Repair 
and Multiple Erasure Tolerance 

Wentu Song and Chau Yuen 

Abstract 

We consider the problem of designing [n,k] linear codes for distributed storage systems (DSS) that satisfy the {r,t)-Local 
Repair Property, where any t'{< t) simultaneously failed nodes can be locally repaired, each with locality r. The parameters 
n,k,r,t are positive integers such that r < k < n and t < n — k. We consider the functional repair model and the sequential 
approach for repairing multiple failed nodes. By functional repair, we mean that the packet stored in each newcomer is not 
necessarily an exact copy of the lost data but a symbol that keep the (r,f)-local repair property. By the sequential approach, we 
mean that the t' newcomers are ordered in a proper sequence such that each newcomer can be repaired from the live nodes and 
the newcomers that are ordered before it. Such codes, which we refer to as {n, k, r, f)-functional locally repairable codes (FLRC), 
are the most general class of LRCs and contain several subclasses of LRCs reported in the literature. 

In this paper, we aim to optimize the storage overhead (equivalently, the code rate) of FLRCs. We derive a lower bound on 
the code length n given t € {2, 3} and any possible k, r. For t = 2, our bound generalizes the rate bound proved in 1141 . For 
t = 3, our bound improves the rate bound proved in Go). We also give some constructions of exact LRCs for t £ {2, 3} whose 
length n achieves the bound of {n, k, r, f)-FLRC, which proves the tightness of our bounds and also implies that there is no gap 
between the optimal code length of functional LRCs and exact LRCs for certain sets of parameters. Moreover, our constructions 
are over the binary field, hence are of interest in practice. 

Index Terms 

Distributed storage, locally repairable codes, exact repair, functional repair. 


I. Introduction 

A distributed storage system (DSS) stores data through a large, distributed network of storage nodes. To ensure reliability 
against node failure, data is stored in redundancy form so that it can be reconstructed from the system even if some of the 
storage nodes fail. Moreover, to maintain the data reliability in the presence of node failures, each failed node is replaced by 
a newcomer that stores a data packet computed from the data packets stored in some available storage nodes. This process is 
called node repair. 

There are two models of node repair, called exact repair and functional repair respectively. By exact repair, each newcomer 
stores an exact copy of the lost data packet. By functional repair, each newcomer stores a packet that is not necessarily an 
exact copy of the lost data, but a packet that makes the system keep the same level of data reliability and the possibility of 
node repair in the future. While exact repair is a special case of functional repair and is more preferable in practice for its 
simplicity, functional repair model has its theoretical interest because potentially it allows us to construct codes with improved 
code rate or minimum distance. 

Modern distributed storage systems employ various coding techniques, such as erasure codes, regenerating codes and locally 
repairable codes, to improve system efficiency. Classical MDS codes (such as Reed-Solomon codes) are optimal in storage 
efficiency but are inefficient in node repair—the total amount of data download needed to repair a single failed node equals 
to the size of the whole file HI. As improvements of MDS codes, regenerating codes aim to optimize the repair bandwidth 
im and locally repairable codes (LRC) aim to minimize the repair locality, i.e. the number of disk accesses required during a 
single node repair El. In this work, we focus on the metric of repair locality. 

Repair locality was initially studied as a metric for repair cost independently by Gopalan et al. 0, Oggier et al. 0, and 
Papailiopoulos et al. 0. The ith coordinate of an [n, k]q linear code C (also called the ith code symbol of C) is said to have 
locality r, if its value is computable from the values of a set of at most r other coordinates of C (called a repair set of i). In 
the literature, an [n, k] linear code is called a locally repairable code (LRC) if all of its code symbols have locality r for some 
r < fc. In a DSS coded by an LRC C, each storage node stores a code symbol of C and any single failed node can be “locally 
and exactly repaired” in the sense that the newcomer can recover the lost data by contacting at most r live nodes, where r is 
the symbol locality of C. 

A. Local Repair for Multiple Node Failures 

In real DSS, it is not uncommon that two or more storage nodes fail simultaneously at one time, which motivates the 
researchers to study LRCs that can locally repair more than one failed nodes. Studies of LRCs for multiple node failures can 
be found in 0-il5l and references therein. 
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To repair t [t >2) simultaneously failed nodes, t newcomers are added into the system, each downloads data from a set of at 
most r available nodes to create its storage content. The authors in lfT4l distinguished two approaches of how the t newcomers 
contact the available nodes, called parallel approach and sequential approach respectively. By the parallel approach, each 
newcomer download data from a set of live nodes. In contrast, by the sequential approach, the t newcomers can be properly 
ordered in a sequence and each newcomer can download data from both the live nodes and the newcomers ordered before it. 
Clearly, the parallel approach is a special case of the sequential approach. Potentially, the sequential approach allows us to 
design codes with improved code rate or minimum distance than the parallel approach. 

Given the parameters n, k, r and t, where n is the code length and k is the dimension, four subclasses of linear LRCs that can 
exactly and locally repair up to t failed nodes by the parallel approach are reported in the literature: a) Codes with all-symbol 
locality (r, f -f 1), in which each code symbol is contained in a local code of length at most r + t and minimum distance at 
least t+1 Q; b) Codes with all-symbol locality r and availability t, in which each code symbol has t pairwise disjoint repair 
sets with locality r 11, Elic) Codes with (r, f)-locality, in which each subset of t code symbols can be cooperatively repaired 
from at most r other code symbols d; d) Codes with overall local repair tolerance t, in which for any E C [n] of size t 
and any i G E, the ith code symbol has a repair set contained in [n]\i? and with locality r 1^ . 

For convenience, we refer to the above four subclasses of LRCs as (r, d)a codes, (r, 5)c codes, (r, f)-CLRC and (r, t)o codes 
respectively, where S = t + 1- Clearly, the first three subclasses are all contained in the subclass of {r,t)o codes. Moreover, 
{r,t)o codes can exactly and locally repair up to t failed nodes by the parallel approach. For {r,S)a codes and (r, f)-CLRC, 
the code rate satisfies (e.g., see M and ini): 


k r 

- < - 

n r + t 

and the minimum distance satisfies (see 13 and my- 


d<n — k+l — t 



( 1 ) 

( 2 ) 


For {r,S)c codes (i.e., codes with all-symbol locality r 


and availability t), it was proved in ifTol that the code rate satisfies: 


k ^ 1 


(3) 


and the minimum distance satisfies: 


t 

d < n — 

i=0 


k-1 

rjp'L 


(4) 


For t = 2, the bound (HI is shown to be achievable for some special case of parameters ifTOl . However, for the general case, 
it is not known whether the bounds Q and (|4li are achievable. Recent work by Wang et al. ifT^ shows that for any positive 
integers r and t, there exist {r,S)c codes over the binary field with code rate Unfortunately, the rate does not achieve 
the bound Q for t > 2. For the more general case, the {r,t)o codes, no result is known about the code rate bound or the 
minimum distance bound for t >2. 

For LRCs that can exactly and locally repair t = 2 failed nodes by the sequential approach, it was proved in lfT4l that the 
code rate satisfies: 



- < 


n r + 2 


(5) 


An upper bound for the minimum distance of such codes was also given in M- However, for f > 3, no result is known about 
the code rate bound or the minimum distance bound. 



Fig 1. Relation of the six subclasses of [n, k] linear LRCs, where <5 = f -|- 1. 
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B. LRC with Functional Repair 

Vector codes that can locally repair single failed node with functional repair model was considered by Hollmann et al. 
ESl-lIIsl. Suppose a is the capacity of each storage node and [3 is the transport capacity, i.e., the amount of data that can 
be transported from a node contacted during the repair process. It was proved in m that if a = /3, the code rate is upper 
bounded by where r is the repair locality. However, the study of LRC for multiple node failures under functional repair 
model is not seen in the literature. 


C. Our Contribution 


Given positive integers n, k, r and t such that r < k < n and t <n — k. We consider the problem of designing [n, k] linear 
codes for distributed storage systems (DSS) that satisfy the {r,t)-Local Repair Property, where any t'{< t) simultaneously 
failed nodes can be locally repaired, each with locality r. We consider the functional repair model and the sequential approach 
for repairing multiple failed nodes. By functional repair, we mean that the packet stored in each newcomer is not necessarily 
an exact copy of the lost data but a symbol that keep the (r, f)-local repair property. We call such codes (n, k, r, f)-functional 
locally repairable code (FLRC). A subclass of FLRC, called {n,k,r,t)-exact locally repairable code (ELRC), in which the 
(r, f)-local repair property is satisfied by exact repair, is also considered. 

Clearly, codes studied in HI are (n, k, r, 2)-ELRC and {r, t)o codes (i.e., codes with overall local repair tolerance t) studied 
in ||6l are (n, k, r, f)-ELRC. The relation of the six subclasses of LRCs mentioned above are depicted in Eig. [T] 

It is easy to see that the minimum distance of an (n, k, r, f)-ELRC is at least f + 1. In this paper, our goal is to optimize the 
storage overhead (equivalently, the code rate of such codes). When f = 1, by the result of m, the code rate of an (n, k, r, t)- 
ELRC is upper bounded by So we focus on the case of f > 2. Our method is to associate each (n, k, r, f)-ELRC with a 
set of directed acyclic graphs, called repair graph. Then by studying the structural properties of the so called minimal repair 
graph (similar to the discussion in lfT9l . 1201), we derive a lower bound of the code length n. Our main results are listed as 
bellow: 

1) We prove that for (n, k,r,t = 2)-ELRC, the code length satisfies 


n > fc + 



Equivalently, the code rate satisfies 

k r 

- < - . 

n r + 2 

Note that bound Q is an upper bound of the code rate of (n, k,r,t = 2)-ELRC. Thus, our bound generalizes the bound Q 
to the setting of functional repair model. 

2) We prove that for (n, k,r,t = 3)-ELRC, the code length satisfies 


n > k -\- 


2fc+r^i 

r 


Note that codes with all-symbol (r, (5 = 4)c-locality is an (n, k,r,t = 3)-ELRC. Eor f = 3, (O implies that n > 

— So our result improves the bound (O for f = 3. 


Moreover, we can check that k - 


zfe+r^i 




> 


3) We give some constructions of (n, fc, r, f)-ELRC for t G {2,3} whose code length n achieves the corresponding bound 
of ELRC, which proves the tightness of our bounds and also implies that there is no gap between the optimal code length of 
functional LRCs and exact LRCs for some sets of parameters. Moreover, our constructions are over the binary field, hence are 
of practical interest. 


D. Organization 

The rest of this paper is organized as follows. In Section II, we give the basic notations and concepts including functional 
locally repairable code (ELRC), exact locally repairable code (ELRC) and repair graph of ELRC. In section III, we prove some 
structural properties of the minimal repair graph of ELRC. Lower bounds on code length of (n, fc, r, f)-ELRC for t G {2,3} 
are derived in Section IV. Constructions of ELRC with optimal code length is presented in Section V. The paper is concluded 
in Section VI. 


IT Preliminary 

Eor any set A, we use |A| to denote the size (i.e., the number of elements) of A. A set B is called an r-subset of A if 
B <Z A and \B\ = r. Eor any positive integer n, we denote [n] := {1, 2, • • • , n}. An [n, k] linear code over a field F is a 
A:-dimensional subspace of F". 
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Let C be an [n, k] linear code over the field F. If there is no confusion in the context, we will omit the base field F and only say 
that C is an [n, k] linear code. A fc-subset S of [n] is called an information set of C if for all codeword x = {xi,X 2 , • • • ,Xn) S C 
and all i G [n], Xi = where all aij G F and are independent of x. The code symbols in {xj,j G S'} are called 

information symbol of C. In contrast, code symbols in {xi, i G [n]\S} are called parity symbol of C. An [n, k] linear code has 
at least one information set. 

For any E C [n], let E = and Cjs be the punctured code of C associated with the coordinate set E. That is, C\e is 

obtained from C by deleting all code symbols in the set {xi, i G E} for each codeword {xi,X 2 • • • , Xn) G C. 

A. Locally repairable code (LRC) 

In this subsection, we always assume that C is an [n, k] linear code over F. We first present the concept of repair set for 
each coordinate i G [n]. 

Definition 1: Let i G [n] and R C [n]\{i}. The subset R is called an {r,C)-repair set of i if |i?| < r and Xi = 
for all X = {xi,X2, ■ ■ ■ ,Xn) GC, where all aj G F and are independent of x. 

In the following, we will omit the prefix (r, C) and say that i? is a repair set of i if there is no confusion in the context. 

Definition 2: Let i? be a f-subset of [n]. C is said to be {E, r)-repairable if there exists an index of E, say E = {ii, ■ ■ ■ , it}, 
and a collection of subsets 

{Ri C EU{ii, ■ ■ ■ ,it-i}] \Rf.\ <rjG [f]} 
such that for each I G [f], Ri is an (r, C)-repair set of ii. 

In this paper, we assume r < k < n, which means small repair locality and at least one redundant code symbol. Moreover, 
if C is {E, ?’)-repairable for some f-subset E of [n], then we can easily see that t < n — k. 

Definition 3: Let C be an [n,k] linear code over F (not necessarily different from C) and E C [n], C is said to be an 
{E, r)-repair code of C if the following two conditions hold; 

(i) C|;g=C'|;g-; 

(ii) C is {E, r)-repairable. 

Consider a DSS with n storage nodes where a data file is stored as a codeword of C, each node storing one code symbol. 
Suppose the nodes indexed by E fail. Then the symbols stored in the live nodes form a codeword x-p^ of the punctured code 
C\-p. If C is an {E, r)-repair code of C, then x-^ is also a codeword of C'|-g. Moreover, since C is {E^ r)-repairable, then we 
can construct a codeword of C from x-g using the sequential approach, which form a process of functional repair. 

Definition 4: An {n,k,r,t)-functional locally repairable code (FLRC) is a collection of [n,k] linear codes {Ca;A G A}, 
where A is an index set, such that for each X G A and each E C [n] of size \E\ < t, there is a A' G A such that Cy is an 
(i?, r)-repair code of C\. 

Definition 5: An (n,k,r,t)-exact locally repairable code (ELRC) is an [n,k] linear code C such that for each E C [n] of 
size \E\ < t, C is r)-repairable. 

Clearly, for any DSS with n storage nodes and a data file of k information symbols being stored, if the (r, f)-local repair 
property is satisfied for functional repair model and the sequential approach, then the coding scheme can be described as an 
(n,/c, r, f)-FLRC. Conversely, any {n,k,r,t)-FLRC can be used as a coding scheme for such DSS. 

Let {Ca;A G A} be an (n, fc, r, f)-FLRC. Suppose i G [n] and Ai 7 ^ A 2 S A. It is possible that the (r, Cai (-repair set of i 
is different from the (r, Caj)- repair set of i. In other words, the repair set of the coordinate i is not fixed, but depends on the 
state of the system. 

From Definition [3 and 12 we can easily see that an [n, k] linear code C is an (n, k, r, f)-ELRC if and only if for all E C [n] 
of size \E\ < t, C is an (E, r)-repair code of itself. So an (n, fc, r, f)-ELRC is naturally an (n, fc, r, f)-ELRC. Moreover, we 
can characterize (n, fc, r, f)-ELRC by a seemingly simpler condition as follows. 

Lemma 6: An [n,k] linear code C is an (n, fc, r, f)-ELRC if and only if for any E C [n] of size \E\ < t, there exists an 
i G E such that i has an (r,C)-repair set contained in [n]\E;. 

Proof: If C is an (n, fc, r, f)-ELRC, then by Definition |2] and |5] there exists an index of E, say E = {L,-- - fit}, such 
that ii has an (r,C)-repair set Ri ^ E = 

Conversely, for any E C [n] of size \E\ =t'<t, by assumption, there exists an ii G E such that ii has an (r, C)-repair 
set Ri Q E = [n]\E. Now, let Ei = Then \Ei\ < t and by assumption, there exists an 12 G Ei such that 12 has 

an (r, C)-repair set R 2 C = E U {Lj. Similarly, we can find an 13 G E\{iifi 2 } such that is has an (r, C)-repair 

set Rs C E U {* 1 , 12 }. And so on. Then we can index E as E = {iifi 2 , ■ ■ ■ fit'} such that each ii has an (r, C)-repair set 
Re C E LI {iifi 2 , • , ie-i}- Thus, by Definition |2] and |5] C is an In, k, r, f)-ELRC. ■ 


B. Repair graph of LRC 

To derive a bound of the code length, we introduce the concepts of repair graph and minimal repair graph of an (n, k, r, f)- 
ELRC and investigate the structural properties of the minimal repair graphs. 
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Let G = (V, £) be a directed, acyclic graph with node (vertex) set V and edge (arc) set £. For any e = (u, v) £ £, we call 
u the tail of e and v the head of e. We also call u an in-neighbor of v and v an out-neighbor of u. For each v €V, let In(r!) 
and Out(r;) denote the set of in-neighbors and out-neighbors of v respectively. If In(u) = 0, we call v a source. Otherwise, 
we call V an inner node. We use S(G) to denote the set of all sources of G. Moreover, for any V CV, let 

Out{V) = y Out{v)\V. (6) 

vev 

And for any v G V, let 

Out^(u) = y Out(u)\Out(u) (7) 

7i£0ut('y) 

i.e., Out^(u) is the set of all w £ V such that w is an out-neighbor of some u £ Out(u) but not an out-neighbor of v. 

As an example, consider the graph as depicted in Fig. |2] We have Out(3) = {9,10} and Out(4) = {10,11}. So by (l6]l, 
Out(y) = {9,10,11}, where V = {3,4}. Moreover, by (|7]i, we have Out^(3) = {13,15,16}. 



Fig 2. An example repair graph where r = 2 and n = 16. 

For any linear code C with repair locality, we can associate C with a set of graphs called repair graph of C. 

Definition 7: Let C be an [n,/c] linear code and G = (V,f) be a directed, acyclic graph such that V = [n]. G is called a 
repair graph of C if for all inner node z £ V, In(t) is an (r, C)-repair set of i. 

A code C may have many repair graphs. Moreover, in Definition I?) we do not require that R = In(i) for any (r, C)-repair set 
R of i. Thus, it is possible that there exists an (r, C)-repair set R of i such that In(i) ^ R. However, we can always construct 
a repair graph G' of C such that In(i) = i? in G'. 

Definition 8: For any in, k, r, f)-FLRC {C\\ A £ A}, let 

5* ^min{|S(GA)|;A£ A,Ga £^a} (8) 

where Q\ is the set of all repair graphs of C^. If Aq £ A and Ga(, is a repair graph of Caq such that S* = |S(Gao)|, then we 
call Gao a minimal repair graph of {C\\ A £ A}. 

Remark 9: Note that for any (n, fc, r, f)-FLRC {Ca; A £ A}, {|S(Ga)|; A £ A, Ga £ G\} Q [n] is a finite set. So by ®, we 
can always find a Aq £ A and a repair graph Gaq of Caq such that <5* = |S(Gao)|- Thus, any (n, fc, r, f)-FLRC has at least one 
minimal repair graph. 


III. Properties of Minimal Repair Graph 

In this section, we investigate the properties of minimal repair graphs of (n, fc,r, f)-FLRC, which will be used to derive a 
lower bound on the code length n in the next section. Our discussions are summarized and illustrated in Fig. |3] 

In this section, we assume {Ca;A £ A} is an (n, fc, r, f)-FLRC and Gaq = (V,f) £ G\o is a minimal repair graph of 
{Ca; a £ A}, where Aq £ A. Note that the node set V = [n]. 

By Definition 0 and |7] Gaq has n — S* inner nodes and each inner node of Gaq has at most r in-neighbors. So we have 

(n — S*)r > |C|. ( 9 ) 

The following lemma shows that the dimension k is upper bounded by the number of sources of G\„. 

Lemma 10: For any (n, k, r, f)-FLRC {Ca; A £ A}, 

k<S* = \S{Gxo)\. (10) 

Proof: Consider an arbitrary A £ A and an arbitrary repair graph Ga of Ca. By Definition!?] G\ is acyclic and for each inner 
node j, In(j) is an (r,C)-repair set of j. Then by Definition [T] and by induction, for all codeword x = {xi,X 2 , ■ ■ ■ ,Xn) £ Ca 
and all j £ [n], the jth code symbol xj is an F-linear combination of the symbols in {xi;i £ S(Ga)}. So the set S(Ga) 
contains an information set of Ca, which implies that k < |S(Ga)|. Since A is an arbitrary element of A and G\ is an arbitrary 
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Fig 3. Relationship of discussions in Section III and IV. 

repair graph of C\, then by Definition 0 we have k < min{|S(GA)|; A G A,G\ G G\} = S* = |S(Gao)|, which proves the 
lemma. ■ 

The following lemma and its corollaries give some structural properties of G\g . 

Lemma 11: For any E C [n] of size \E\ = t' < t, 

|Out(F;)| > |£;ns(GA„)|. (ii) 

Proof: We can prove this lemma by contradiction. 

By Definition m there is a Ai G A such that Caj is an (i?, r)-repair code of Caq- By Definition |2l there exists an index of 
E, say E = {ii,i2, ■ ■ ■ , H'}, and a collection of subsets 

{Rt C EU {ii, • • • \Re\ <r,£ G [f']} 

such that Rg is an (r, CAi)-repair set of ig for each £ G [t']. We construct a repair graph Gaj of as follows: First, for each 
i G Ell Out(i?) and j G In(i), delete the edge (j, i); Then for each it G E and each j G Re, add a direct edge from j to 
Clearly, S(Gai) = {S{G\g)\E) UOut(iii). Here we fix the notation Out{E) to be defined in G\g. For each inner node i of 
Gai, we have the following two cases: 

Case 1: i G E. Then i = ii for some £ G \t'] and by the construction of Ga^, In(i) = Re is an (r, Cai )-repair set of i. 

Case 2: i is an inner node of G\g and i ^ Out(i5). Then In(i) G_ E = [n]\£i is an {r,C\g)-repair set of i. Moreover, since 
Cai is an {E,r)-repair code of C\g, then by condition (ii) of Definition [2 CaJ-^ = C\g\-^. So In(i) is also an (r, Cai)- repair 
set of i. 

Thus, for each inner node i of G\^, In(i) is an (r, Caj(- repair set of i. So Gaj is a repair graph of C\^. 

Now, suppose |Out(ii^)| < |i5 n S(Gao)|. Then we have 

|S(GAj| = |(S(GA„)\i?)UOut(i:;)| 

= |(S(GA„)\i?)| + |Out(i?)| 

= |(S(GA„)|-|i?nS(GA„)| + |Out(i?)| 

< |S(Ga„)| 

which contradicts to Definition^ Thus, by contradiction, we have |Out(i?)| > |i? fl S(Gao)|. ■ 

Example 12: Let G\g be as in Fig. |2]and Gaq be a repair graph of C\g with repair locality r = 2. By Definition |7] {2,3} is a 
(r, Cao(- repair set of 9, {3,4} is a repair set of 10, etc. Let Cai be an {E = {2,3,9}, r(-repair code of C\g such that the (r, Cai(- 
repair sets of 2, 3 and 9 are {1,10}, {12,13} and (11,14} respectively. As in the proof of Lemma fTTl we can construct a graph 
Gai as in Fig.|ll In G\g, we have Out(iiJ( = {10}. In Gaj, we have S(Gai( = (S(GAo(\i?(UOut(i?( = (S(Gao(\{ 2, 3}(U{10}. 
Moreover, we can check that Gai is a repair graph of Cai . In fact, note that by Definition [2 Cai = Cxg j-g-, where E = N\i?. 
Then {4, 5} is also an (r, CAi(-repair set of 11. Similarly, {6, 7} is an (r, CAi(-repair set of 12, etc. So|7] Gai is a repair graph 
of Cai . 

Corollary 13: Suppose t > 3. For any source v, the following hold: 

1) |Out(u)| > 1. 

2) If |Out(u(| = 1, then Out^(u( = Out(u'( 0, where v' is the unique out-neighbor of v. 
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Fig 4. The graph obtained from G\g by the process in the proof of Lemma fTTI for E = {2, 3, 9}, where G\g is depicted in Fig. [2]and the repair sets 
of 2, 3 and 9 are {1,10}, {12,13} and {11, 14} respectively. 


3) If Out(i;) = {ill} and Out(i;i) = {V 2 } for some inner nodes vi and V 2 , then Out(i; 2 ) 7 ^ 0. 

4) If Out(i;) = {ill} and Out(?;i) = {i; 2 } for some inner nodes vi and V 2 , then |Out(M)| > 2 for any source u that belongs 

to In(i; 2 ). 

5) If V and w are two different sources and |Out(u)| = |Out(i(;)| = 1, then the unique out-neighbor of v is different from 
the unique out-neighbor of w. 

Proof: We can prove all claims by contradiction. 

1) Suppose V has no out-neighbor. Picking E = {u}, then |Out(i?)| = |0| = 0 < |i? fl S(C?Ao)| = |{'u}| = 1, which 

contradicts to Lemma fTTI (e.g., see 1) of example fl4l} Thus, v must have at least one out-neighbor. 

2) Suppose Out(i;') = 0. Picking E = {v,v'}, then |Out(£i)| = |0| = 0 < fl S(G'aq)| = |{u}| = 1, which contradicts 
to Lemma fTTI (e.g., see 2 ) of example [14]) So it must be that Out(?;') 7 ^ 0. Since Gaq is acyclic and {u'} = Out(u), then 
v' G Out(?;'). By (|2ll, Out^(z;) = Out(i;') f- 0. 

3) Suppose Out(?; 2 ) = 0. Picking E = {v,vi,V 2 }, then |Out(ii^)| = |0| = 0 < |ii^nS(GAo)| = = 1’ which contradicts 

to Lemma fTTI (e.g., see 3) of example [441} So it must be that Out(u 2 ) 7 ^ 0. 

4) Suppose |Out(M)| < 2. Since u G In(?; 2 ), then Out(it) = {U 2 }. Picking E = {v,vi,u}, we have |Out(£l)| = |{u 2 }| = 
1 < n S(Ga(,)| = |{u,u}| = 2 , which contradicts to Lemma fTTI (See 4) of example fT4l} So it must be that |Out(M)| > 2 . 

5) Suppose Out(u) = Out(r(;) = {vi}. Picking E = {u,?!;}, we have |Out(£')| = |{ui}| = 1 < |iilnS(GAo)| = = 2, 

which contradicts to Lemma fTTI (e.g., see 5) of example [l4l} Thus, the out-neighbor of v and w must be different. ■ 

The following example illustrates the arguments in the proof of Corollary [O] 

Example 14: For the repair graph Gaq in Fig. |2| we have the following observations; 

1) Let u = 1. Note that Out(l) = 0. If we pick E = {1}, then we have |Out(£')| = |0| = 0 < |£'nS(GAo)| = |{1}| = 1- 

2) Letu = 2 and v' = 9. Note that Out(9) = 0. If we pick Li = {2, 9}, then |Out(£;)| = |0| = 0 < |L;nS(GAo)| = |{2}| = 1- 

3) Let u = 5,ui = 11 and V 2 = 13. Note that Out(13) = 0. If we pick E = {5,11,13}, then |Out(i?)| = |0| = 0 < 
|£;ns(GAj| = |{5}| = L 

4) Let V = 6,vi = 12, U 2 = 14 and u = 8. Note that |Out( 8 )| = 1. If we pick E = {6,8,12}, then |Out(i?)| = |{14}| = 
i<|L;ns(GA„)| = |{6,8}| = 2. 

5) Let u = 6 ,w = 7 and vi = 12. If we pick E = {6,7}, then |Out(L;)| = |{12}| = 1 < |L;nS(GAo)| = l{6,7}| = 2. 

Remark 15: In Corollary [13] 1) holds for all f > 1 and 2), 5) hold for all t > 2. In fact, in the proof of 1), contradiction is 

derived from a subset E of size 1. So the proof is valid for all f > 1. Hence, 1) holds for all f > 1. Similarly, checking the 
proof of 2) and 5), we can see that they hold for all t > 2. 



Fig 5. An example repair graph G, where n = 12 and r = 2. 


Corollary 16: Suppose v G S(Gao) and Out('u) = {ui,U 2 } for some inner nodes vi and V 2 - If f > 3, the following hold: 

1) Out(ui) 7 ^ 0 or Out(i; 2 ) 7 ^ 0. 

2) If vi = Out(M) for some source u, then Out(v 2 ) 7 ^ 0. 

3) If vi = Out(M) for some source it, then |Out(i(;)| > 2 for any source w that belongs to In(i! 2 ). 

Proof: We can prove all claims by contradiction. 

1) Suppose Out(iii) = 0 and Out(i; 2 ) = 0. Picking E = {u, Ui, U 2 }, we have |Out(iil)| = |0| = 0 < |i?nS(GAo)| = = 1’ 

which contradicts to Lemma [TT] (See 1 ) of example [T?]) So it must be that Out(iii) 7 ^ 0 or Out(i; 2 ) 7 ^ 0. 




2) Suppose Out(u 2 ) = 0. Picking E = {u, u, U 2 }, we have |Out(iiJ)| = |{ui}| = 1 < |£'nS(GAo)| = |{'y,'«}| = 2, which 
contradicts to Lemma [TT] (e.g., see 2) of example [TT]) So it must be that Out(u 2 ) ^ 0- 

3) Suppose w G In(v 2 ) is a source and |Out(r(;)| < 2. Then Out(?u) = {^ 2 }. Picking E = {u,v,w}, we have |Out(£^)| = 

|{ui,U 2 }| = 2 < |ii^nS(GA(,)| = |{u,t;,tt;}| = 3, which contradicts to Lemma [TT] (e.g., see 3) of example fTTl) So it must be 
that |Out(w)| >2. ■ 

The following example illustrates the arguments in the proof of Corollary [Thl 

Example 17: For the repair graph G in Fig. [Sj we have the following observations; 

1) Let u = 5,z;i = 9 and V 2 = 10. Note that Out(9) = Out(lO) = 0. If we pick E = {5,9,10}, then we have |Out(£’)| = 
|0| = o<|L;ns(G)| = |{5}| = i. 

2) Let u = 2,wi = 7,U 2 = 8 and it = 1. Note that Out(8) = 0. If we pick E = {1,2,8}, then |Out(£^)| = |{7}| = 1 < 

|£;ns(GAo)| = |{i,2}| = 2. 

3) Let u = 2,ui = 7,i ;2 = 8,It = 1 and w = 3. Note that |Out(3)| = 1. If we pick E = {1,2,3}, then |Out(i5)| = 
|{7,8}| = 2< |£;nS(GAj| = |{1,2,3}| = 3. 


IV. Bound of Code Length 

In this section, we will prove a lower bound on the code length n for (n, k, r, f)-FLRC with t G {2, 3}. 


A. Code Length for (n, fc, r, 2)-FLRC 

The following theorem gives a lower bound on the code length of (n, fc, r, 2)-FLRC. 
Theorem 18: For {n,k,r, 2)-FLRC, we have 


n > k + 



( 12 ) 


Proof: Suppose {Cy, A G A} is an (n, k, r, 2)-FLRC and Gaq = (V, £) is a minimal repair graph of {Ca; A G A}, where 
Aq £ A, V = [n] is the node set of Gy and S is the edge set of Gy. We first prove n > S* + where S* = |S(Gao)|. 

By Remark [ 15 ] and 1) of Corollary [T3| each source of Gaq has at least one out-neighbor. Let £red be the set of all edge e 
such that the tail of e is a source. We call each edge in £red a red edge. Let A be the set of all source v such that v has only one 
out-neighbor. Then the number of all red edges is |fi-ed| > |A|-|-2 (|S(Gao)\A|) = |A|-|-2(|S(Gao)| — |A|) = 2|S(Gao)| — |A| = 
28* — |A|. Thus, we have 

|f,ed| > 2^* - |A|. (13) 

For each v G A, since v has only one out-neighbor, by Remark [TSl and 2) of Corollary [T3] Out^(u) = Out(u ') ^ 0, 
where v' is the unique out-neighbor of v. Let £green{y) be the set of all edges whose tail is v'. Then fgreen(u) f 0. Let 
fgreen = Ut,6A ^green (u). We Call each edge in fgreen a green edge. For any two different ui,t ;2 G A, let v[,V 2 be the unique 
out-neighbor of t;i ,'!;2 respectively. By Remark [TSl and 5) of Corollary flj] f v'^. So we have fgreen(ui) nfgreen(u 2 ) = 0. 
Thus, the number of all green edges is |fgreen| = | U«gA ^green(u)| = I4reen(u)| > |A|, i.e., 

Ifgi-eenl > |A|. (14) 

Clearly, fred H fgreen = 0- Then by ([T3T l and (fTTl i. we have 


1^1 > l^red U fgi-eeni — l^redl + l^greenl > 28*. 


On the other hand, by (|9]l, we have 

(n — 8*)r > \£\. 

Thus, we have (n — 8*)r > 28*, which implies that nr > 8*(r -|- 2). So n > ^ = b* -f 

By Lemma [TOl k < 8* = |S(Gao)|- So n > 8* + ^ Moreover, since n is an positive integer, then we have 

n>k+ , which proves (IT^ . ■ 

In im, it was proved that the code rate of an (n, fc, r, 2)-ELRC satisfies bound Q. Note that ([T2]) also implies F ^ 7T2' 
So our result generalizes bound (|5]l to (n, k, r, 2)-FLRC. 
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B. Code Length for {n, fc, r, 3)-FLRC 

The following theorem gives a lower bound on the code length of (n, k, r, 3)-FLRC. 
Theorem 19: For (n, k, r, 3)-FLRC, we have 


n > k + 


2k+\7^ 

r 


(15) 


Before proving Theorem [19] we first prove the following Lemma |20| In the rest of this subsection, we always assume 
{Ca;A G A} is an (n, fc, r, 3)-FLRC and Gao = (V,f) is a minimal repair graph of {Ca;A G A}, where Aq G A, V = [n] is 
the node set of G\g and E is the edge set of Gaq- Then (5* = |S(Gaci)|, where 8* is defined by (| 8 ]l. 

Lemma 20: For (n, fc, r, 3)-FLRC, we have 


{n-6*)r > \£\ > 26* + 


(16) 


Proof: By (|9]l, we have (n — 5*)r > \E\, which proves the first inequality of (IThl l. So we only need to prove the second 
inequality of ( [T6b . To do this, we will divide the source set S(Gao) and the edge set E into mutually disjoint subsets. 

We can divide the source set S(Gao) into four subsets A, B, Ci and C 2 as follows: 


A = {v€S{Gx,);\Out{v)\>3}, 


(17) 


i7 = {uGS(GA„);|Out(u)| =2}, 


(18) 


Gi = {t; G S(Gao); |Out(u)| = 1 and |Out^(?;)| = 1} (19) 

and 

G 2 = {u G S(Gao); |Out(?;)| = 1 and \Out^{v)\ > 2}. (20) 

Clearly, A, B, Gi and G 2 are mutually disjoint. Moreover, by 1), 2) of Corollary [T3] S{G\g) = AU B U Ci U G 2 . Hence, 

5* = |S(Ga„)| = |A| + \B\ + |Gi| + IG 2 I. (21) 


We can divide the edge set E into three subsets as follows. 

Firstly, an edge is called a red edge if its tail is a source. For each v G S(Gao), let £^red('f) be the set of all red edges whose 
tail is V and Ered = Uj;es(GA )^red(t^) be the set of all red edges. Clearly, |^red(t^)| = |Out(z;)| and fred(^t') nfred(i') = 0 for 
any source w f v. So by ([l% — (l20l i. we have 

I f red I = E 10 ut(u)| >3|A|+2|S| + |Gi| + |G 2 |. (22) 

«es(GA„) 

Secondly, an edge is called a green edge if its tail is the unique out-neighbor of some source in Gi UG 2 . For each v G Gi UG 2 , 
let fgreen('f) be the set of all green edges whose tail is the unique out-neighbor of v and Egreen = Ui;eCiuG 2 ^green('t') be the 
set of all green edges. Note that by 2) of Corollary [T3l Out^(u) = Out('(;') f 0, where v' is the unique out-neighbor of v. 
Then |£’green(f)| = |Out^(u)|. Moreover, if G Gi U G 2 are different, then by 5) of Corollary [T3l their out-neighbors are 
different. So fgreenCt^) FI £^green(ti’) = 0- Hence, by ( [T9] l and (l20l i. we have 

|fgreen|= ^ jOut^ (u) | > |Gi | + 21G 2 1. (23) 

j;eGiUC2 


Thirdly, suppose u G B U Gi and e G £ such that e is neither a red edge nor a green edge. Then e is called a blue edge 
belonging to v if one of the following conditions hold: 

(a) V € B and the tail of e belongs to Out(?;). 

(b) V G Gi and the tail of e belongs to Out^(u). 

Let £biue(i’) denote the set of all blue edges belonging to v and fbiue = UjjeBuGi ^biue('y)- We have the following claim 1, 
whose proof is given in Appendix A. 

Claim 1: The number of blue edges is bounded by 


I f blue I > 


\B\ + \G,\ 


r 


(24) 
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Clearly, fred, ^green and fbiue are mutually disjoint. Then by (I2n i- (l24l i. we have 


\£\ > l^redl + I ^green I + l^bluel 
> (3|^| + 2\B\ + jCil + |C'2|) 
\B\ + \C,\ 


+ (|C'i| + 21(721) + ^ 

= 2(|7l| + |i?| + |Ci| + |C2|) 

+ (|7l| + |C2|+ '^'t’^'’ ) 


= 26* 


> 26* + 

6 * 

= 26* + —. 
r 


r\A\ + r\C 2 \ + \B\ + \Ci\ 
1^1 + \C2\ + \B\ + \Ci\ 


< 5 * 


, which proves the second inequality of (fThl l. 


Note that \£\ is an integer. Then we have \£\ > 26* 

By the above discussion, we proved (fThl l. which in turn proves Lemma [ 

To help the reader to understand the proof of Lemma |20] we give an example as follows. 

Example 21: Consider the graph in Fig. [h] Using the notations defined in the proof of Lemma l20l we have A = {2,4, 7}, 
B = {3,6}, Cl = {1} and C 2 = {5}. 

It is easy to find all red edges. We can also easily find that fgreen(l) = {(8,11)} and fgreenCS) = {(10,12), (10,13)}. 

Since 1 € (7i and 11 S Out^(l), then (11,14) G £biue(l); Since 11 G Out(6) and 6 G B, then (11,14) G £biue(6); Since 3 € B 
and 9 G Out(3), then (9, 11) G fbiue(3). We can further check that £^biue(l) = fbiue(6) = {(11,14)} and £^biue(3) = {(9,11)}. 



Fig 6. An example of partitioning the edge set of minimal repair graph: The red (resp. green, blue) edges are colored by red (resp. green, blue). 


Now, using Lemma |20] and Lemma [TO] we can give a simple proof of Theorem [T9] 
Proof of Theorem M By Lemma |20l we have 


{n-6*)r > \£\ > 26* + 


5^ 

r 


So 


(n - 6*)r > 26* + 


6^ 

r 


Solving n from the above equation, we can obtain 


By Lemma [TOl we have 6* > k. So 


From (125]) and (]2^ . we have 


26*+ ^ 

n>6* + - 

r 



> k + 


2fc+m 

r 


n > k 


r 


(25) 


( 26 ) 
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Fig 7. Comparison of the code length bounds for t = r = 3 and S = t 1 = A. 


Since n is a positive integer, then we have n > k 


2fc+r^i 


, which proves Theorem [19] 


We next show that the bound (ITSl l improves the bound Q for codes with all-symbol (r, 4)c-locality. Note that for such 
codes, the bound © is equivalent to 

r-|-12r + 13r + l. 


n > 


-k. 


r 2r 3r 

Also note that codes with all-symbol (r, 4)c-locality are (n, fc, r, 3)-ELRC. Then by (ITSl l. we have 


n > k + 




>k + 


2fc+rii 


(27) 


(28) 


It is easy to check that 


2fc+r|l\ fr + 12r + 13r + l. 


_ 1 
r 

> 0 . 


k\ k 

6r 


1 - 


2r 3r 
1 


So (l28l l is an improvement of dZTl) . 

An illustration of the gap between the bounds (ffSl) and Q for the parameters f = r = 3 is given in Fig. [7] from which we 
can see that dfSl) is tighter than (O for f = 3. 


V. Code construction 

In this section, we give some constructions of (n, fc, r, 2)-ELRCs and (n, k, r, 3)-ELRCs whose length n achieve the bounds 
dTSli and (ITSl l respectively. We call such codes optimal (n, k, r, 2)-ELRC and optimal (n, k, r, 3)-ELRC respectively. By these 
constructions, we prove the tightness of the bound ([TtI) and (flSl) . Moreover interestingly, our results show that for some sets of 
parameters, exact LRCs is sufficient to achieve the optimal code length of functional LRCs. Our discussions are summarized 
and illustrated in Fig. 0 

We begin with a lemma that gives a method to construct subsets of [n] that can be used to construct repair set for LRC. 
Lemma 22: Let C = {Ci, • • • be a collection of pairwise disjoint subsets of [n] and (ri,r 2 , • • • ^tk) be a AT-tuple 

of positive integers such that \^i\ = Sti Let M he a K x N binary matrix such that for each i G [K] and each 

j G [W], the sum of the zth row is and the sum of the jth column is \Cj\. Then there exists a collection {Bi, - ■ ■ , Bk} of 
subsets of U^i that; 

(i) Bi, - ■ ■ , Bk are pairwise disjoint and IJfci — U^i 

(ii) \Bi\ = Ti for all i G [K]; 

(iii) jSi n Cjj < 1 for all i G [K] and j G [Wj. 

Proof: For each j G [W], since the sum of the jth column of M is \Cj\, we can replace the ones of the jth column by 
elements of Cj such that each element of Cj appears exactly once. Denote the resulted matrix by M'. Now for each i G [K\, 
let Bi be the elements of the ith row of M' except the zeros. 
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Fig 8. Relationship of discussions in Section V. 


Since Ci, • • • , Cn are pairwise disjoint and for each j S [N], each element of Cj appears exactly once in the jth column 
of M', then each element of U^i appears exactly once in M', which implies conditions (i) and (iii). Moreover, since the 
sum of the ith row of M is then \Bi\ = for all i G [K], So condition (ii) is satisfied. ■ 

We give an example in the below to demonstrate the construction method used in the proof Lemma l22l 
Example 23: Let Ci = {1,2,3,4, 5}, C 2 = {6,7,8,9,10}, C 3 = {11,12,13,14,15}, C 4 = {16,17,18,19,20}, Cg = 


{22,23,24,25}, Cq = {27,28,29,30} and Cr = {31,32,33}. Let ri = 
iZi=i \^i \ = 31 = X]i=i Let 


= rg = 5 and = 3. Then we have 


M : 


( 1 
0 
1 
1 
1 
1 
0 


\ 


We can check that for each i, j € {1, 2, • • • , 7}, the sum of the ith row is and the sum of the jth column is \Cj\. Replacing 
the ones of the jth column of M by elements of Cj, we obtain 


M' = 


From M', we can obtain subsets Bi = {1,6,11,16,22}, B 2 = {7,12,17,27,31}, B^ = {2,8,13,18,23}, B 4 = 
{3,14,19, 28, 32}, Bg = {4, 9, 24, 29, 33}, Bq = {5,10,15} and B^ = {20, 25, 30}. It is easy to check that conditions 
(i)—(iii) of Lemma l22l are satisfied. 

Corollary 24: Let C = {Ci, • ■ • , Cm} be a collection of pairwise disjoint ^-subsets of [n] and r = (ri, • • • , ) be a 

iT-tuple of positive integers such that < l-Cj = for all i G \K\. Then there exists a collection 


/1 

6 

11 

16 

22 

0 

0 

\ 

0 

7 

12 

17 

0 

27 

31 


2 

8 

13 

18 

23 

0 

0 


3 

0 

14 

19 

0 

28 

32 


4 

9 

0 

0 

24 

29 

33 


5 

10 

15 

0 

0 

0 

0 


V 0 

0 

0 

20 

25 

30 

0 

/ 




,Bk} of subsets of Cj such that; 


(i) i?i, • ■ • , Bk are pairwise disjoint and Bi = Cj; 


\N 


(ii) \Bi \ = n for all i G [K\, 

(iii) \BiCCj\ < 1 for all i G [K\ and j G [A^]. 

Proof: Since ^^d Ti < N for all i G {K\, using the Gale-Ryser Theorem (see Manfred liSTl ). we can 

construct a K x N binary matrix M such that for each i G \K] and each j G [A^], the sum of the ith row of M is and the 
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sum of the jth column of M is S = \Cj\. By Lemma l22l there exists a collection {Bi, • • • , Bk} of subsets of U^i ^3 '^^at 
satisfies the conditions (i)—(iii). ■ 

The following two lemmas give a sufficient condition of (n, k, r, 2)-ELRC and (n, k, r, 3)-ELRC respectively. 

Lemma 25: Let C be an [n, k] linear code and [n] = S' U T such that S fl T = 0. Then C is an (n, k, r, 2)-ELRC if the 
following two conditions hold; 

(i) Each j G S has two disjoint (r, C)-repair sets; 

(ii) Each z G T has an (r,C)-repair set RC S. 

Proof: We will prove that for any E C [n] of size \E\ < 2, there exists an i C E such that i has an (r,C)-repair set 
R C We have the following two cases: 

Case 1: E' n S = 0. Then E <LT and by condition (ii) each i G E has an (r,C)-repair set i? C S C [n]\E. 

Case 2; E n S 0. Suppose z G E Cl S. By condition (i), i has two disjoint (r, C)-repair sets, say Ri and i?2. Note that 
|E| < 2 and z ^ i?i U R 2 , then either E n i?i = 0 or E n i?2 = 0. Without loss of generality, assume E n i?i =0. Then we 
have i?i C [n]\E. 

Thus, we can always find an z G E that has an (r, C)-repair set R C [n]\E. By Lemma |6] C is an (n, fc, r, 2)-ELRC. ■ 

Lemma 26: Let C be an [rz, k] linear code and [n] = S' U T such that S fl T = 0. Then C is an (n, k, r, 3)-ELRC if the 

following two conditions hold; 

(i) Each z G S has two disjoint (r, C)-repair sets, say i?i and i?2, such that each j G Ri has an (r, C)-repair set i?n(i?2U{z}) = 
0; 

(ii) Each z G T has an (r,C)-repair set E C S; 

Proof: Eor any E C [n] of size |E| < 3, similar to the proof of Lemma l25l we have the following two cases: 

Case 1: E fl S = 0. Then E C T and by condition (ii) each z G E has an (r, C)-repair set E C S C [n]\E. 

Case 2: E n S 7^ 0. Let z G E fl S. By condition (i), z has two disjoint (r, C)-repair sets, say Ei and E2, such that each 

j G El has an (r, C)-repair set E fl (E2 U {z}) = 0. Then we have the following two subcases; 

Case 2.1: E fl Ei = 0 or E n E2 = 0. If E n Ei = 0, then Ei C [n]\E; If E fl E2 = 0, then E2 Q So in this 

subcase, z has an (r,C)-repair set contained in [n]\E. 

Case 2.2; E fl Ei 7^ 0 and E n E2 7^ 0. Assume j G E fl Ei and j' G E fl E2. Then by condition (i), j has an (r, C)-repair 
set E n (E2 U {z}) = 0. So 

En(E2U{z,j}) = 0. (29) 

On the other hand, since Ei fl E2 = 0 and |E| < 3, then j 7^ j' and 

= {l j,/} C E2 U {z,j}. (30) 

Combining ( l30t and ( 1291 ). we have E C [n]\E. So in this subcase, j G E has an (r,C)-repair set E C [n]\E. 

Thus, we can find an element of E that has an (r, C)-repair set E C [n]\E. By Lemma |6] C is an (n, k, r, 3)-ELRC. ■ 


A. Optimal {n,k,r,2)-ELRC 

In this subsection, we give a method for constructing [n = k + , fc, r, 2)-ELRC. Our construction is based on the 

following lemma. 

Lemma 27: Suppose Lf J > - There exists a collection A = {Ai, • • • , of zy = \^~\ subsets of [k] such that: 

(i) I Ail < r for each z G [zy]; 

(ii) lAi n Ajj < 1 for all {z, j} C [zy]; 

(iii) Each z G [/c] belongs to exactly two subsets in A; 

Proof: The proof is given in Appendix B. ■ 

The following are two examples of subsets that satisfy conditions (i)—(iii) of Lemmas |27l 

Example 28: Eor A: = 12 and r = 3, we have zy = \^~\ = 8. Let A = {Ai, • • • , Ag} be as in Eig. |9ja), where each subset 
in {Ai, • • • , A 4 } is represented by a red line and each subset in {A 5 , • • • , Ag} is represented by a blue line. We can check 
that conditions (i)—(iii) of Lemmas |22] are satisfied. 

Example 29: Eor k = If) and r = 3, we have zy = \^~\ = 7. Let A = {Ai, • • • , A7} be as in Eig. |9}b), where each subset 
in {Ai, A2, Ag} is represented by a red solid line, A4 is represented by a red dashed line and each subset in {Ag, Ag, A7} is 
represented by a blue line. We can check that conditions (i)—(iii) of Lemmas are satisfied. 

Now we have the following construction. 

Construction 1: Let LIJ> r and A = {Ai, • • • , A,;} be constructed as in Lemma l27l where zy = . Let xi, - ■ ■ , Xfe be 

k information symbols. Then we can construct a [k + rj, k] systematic linear code C over F2 with zy parities Xfc+i, • • • , Xk+rj 
such that Xk+i = ^3 z G [zy]. 

Theorem 30: The code C obtained by Construction \ is an [n = k + , fc, r, 2)-ELRC. 
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Fig 9. Subsets of [k] that satisfy conditions of Lemmas |27l (a) is for k = 12 and (b) is for k = 10. 


Proof: Let S = [fc] and T = {fc + 1, • • • ,k + r]}, where r] = . Then we have S CiT = 0. By conditions (ii), (iii) 

of Lemma l27l for each i € S, there exist two subsets, say Ai^ and Ai^, such that Ai^ n Ai^ = {i}. By Construction 1 and 
condition (i) of Lemma l27l Ri = Ai^ U {fc + *i}\{*} and i?2 = Ai^ U {fe + i2}\{*} are two disjoint (r, C)-repair sets of i. 
Moreover, for each i € T, again by Construction 1 and condition (i) of Lemma |22l Ai-k is an {r, C)-repair set of i. So by 
Lemma 122] C is an (n, fc, r, 2)-ELRC. ■ 

Note that the code C obtained by Construction 1 has length n = k + T] = k+ which meets the bound (fTSl i. So from 
Theorem [30| we can directly obtain the following theorem. 

Theorem 31: If LIJ > then there exist (n, fc, r, 2)-ELRC over the binary field that meet the bound (fT2l) . 

The authors in ca constructed binary codes with all-symbol locality r, availability t and code rate for n = *) 

and any positive integer r and t (such codes are a subclass of (n, fc, r, f)-ELRC). Eor f = 2, we have n = 
fc = = ifchil In our construction, we require that > r, which implies that k>r^> if r > 1. 


B. Optimal {n,k,r,3)-ELRC 

In this subsection, we give a method for constructing {n = k ■ 


2fc-i-rli 


, fc, r, 3)-ELRC. We always denote 


and 


Then we have 


'2fc+r7i’ 


■fc' 


2k + m 

r 


r 


r 


— m. 


= fc- 


2k+\^^ 


= k + m + i. 


Our construction is closely related to the following concept. 

Definition 32: A mesh of [n] is a collection TZUB of subsets of [n], where TZ = {i?Li, • • ■ , RLm} and B = {BLi, • • • , BLi} 
are called red lines and blue lines respectively, that satisfies the following conditions: 

(i) Eor each i G [m], RLi C [fc -|- m], \RLi\ = r + I and RLi n {fc -f 1, • • • , fc -f m} = {k + i}; 

(ii) Eor each j G \E\, BLj n {fc -f m -f 1, • • • ,n} = {k + m + j} and \BLj\ < r + V, 

(iii) Each i G [k + m] belongs to exactly two lines, at least one is a red line; 

(iv) Any two different lines have at most one point in common; 

(v) Any two different lines that intersect with the same red line are disjoint. 

Here a line means a subset in 72. U 6 (i.e., a red line or a blue line) and a point means an element of [nj. 


“ rr 1 ~ ® n = fc-|-m-|-.f = 22. Let 


Example 33: Eor fc = 12 and r = 3, we have m = {^] = 4, £ = 

TZ = {RLi, • • ■ , RL 4 } be the red lines and {Bi, • • ■ , Bq} be the blue lines in Eig. fTOl al. Then extend each Bi to a blue line 
BLi as in Eig. [To{ b). Let B = {BLi^ • • • , BLq}. We can check that 72 U S is a mesh of [nj. 

Example 34: Eor fc = 16 and r = 3, we have m = 6 , i = 7 and n = 29. Let 72 = {7?Li, • • ■ , RL^, RLq}, where 
RLi, ■ ■ ■ ,RL^ are the red solid lines in Eig. [TIT a) and Rq is the red dashed line in Eig. [Ufa). We partition the first three 
columns into Bi = {2, 6,9}, B 2 = {18,19}, B3 = {17}, B 4 = {3}, B 5 = {8}, Bq = { 1 , 5} and B 7 = {4, 7}. In Eig. [TT}a). 
each Bi of size \Bi\ > 2 is represented by a blue line and the other points of the first three columns represent the BiS of size 
1. Eurther, we extend each Bi to a blue line BLi as in Eig. fTTI bL Let B = {BLi, • • • , BLq). Then we can check that 72 U S 
is a mesh of [n]. 

The following two lemmas and their proofs give some constructions of mesh of [n]. 

Lemma 35: If rjfc and m > r, there exists a mesh of [n]. 
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Fig 11. Constmction of a mesh of [n], where k = 16, r = 3 and n = 29. 

Proof: The proof is given in Appendix C. ■ 

Lemma 36: Suppose X = r mod A:>0. Iff>r + A + 1 and m > 2r — A + 1, then there exists a mesh of [n]. 

Proof: The proof is given in Appendix D. ■ 

Now, we have the following construction. 

Construction 2: Let TZL)B be a mesh of [n], where TZ = {RLi, • ■ • , RLm} is the set of red lines and B = {BLi, • • • , BLi} 
is the set of blue lines. Let Xi, - ■ ■ ,Xk be k information symbols. Then we can construct an [n = k + m + i, k] systematic 
linear code C over F2 such that the parities are and are computed as follows; 

• For each i € [m], 


Xk+i — ^ ^ Xj . 

j£RLi\{k+i} 


(31) 


• For each z S [£], 


^k+m+i — ^ ^ ^j- (32) 

Note that by condition (i) of Definition!^ for each i G [m], we have RLi\{k + z} C [k]. So by OTT i. Xk+i is computable 
from information symbols. Similarly, for each i G [t], by condition (ii) of Definition U7\ BLi\{k + m + z} C [fc + m]. So by 
(O, Xk+m+i is computable from {xj]j G [k + m]}. Hence, Construction 2 is reasonable. 

Theorem 37: The code C obtained by Construction 2 is an {n = k + m + i,k,r, 3)-ELRC. 

Proof: Let S = [k + m] and T = {fc + m + 1, • • ■ , n}. Then S' n T = 0. 

For each i G S, hy conditions (iii) and (iv) of Definition there exists a red line L G TZ and a line L' G TZU B such 
that L n L' = {z}. By conditions (i), (ii) of Definition [32\ |L\{z}| = r and |L'\{z}| < r. So by OTI) and (l3^ . Ri = T\{z} 
and i?2 = two disjoint (r, C)-repair sets of z. Moreover, for each j G L\{z}, by condition (i) of Definition 

j G L (G [k + m]. Then by condition (iii) of Definition!^ there exists an L" gTZUB such that L" f L and j G L". Clearly, 
L" f L'. (Otherwise, {i,j} C L n L' = L fl L", which contradicts to condition (iv) of Definition!^) So by condition (v) of 
Definition!^ L" H L' = 0. Let R = L''\{j}. Then R fl (i?2 U {z}) ^ L” L' = % and by dSTb , ( |32] |. R is an (r, C)-repair set 
of j. 
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For each i S T, let i' = i — {k + m). Then i' € [P\. Let R = BLi\{i\. Then by condition (ii) 
i? C [fc + m] = .S' is an (r, C)-repair sets of i. 

By Lemma l26l C is an (n, fe, r, 3)-ELRC. 

r 2fc+ f —1 ' 

Note that the code C obtained by Construction 2 has length n = k + m + i = k+ —, 
So the following theorem is a direct consequence of Lemma [T5l [^and Theorem [TTl 
Theorem 38: Suppose one of the following conditions hold: 

(i) r\k and m > r. 

(ii) ^ > r + A + 1 and m > 2r — A + 1, where A = r mod fc > 0. 


of Definition and by 


which meets the bound 


(O, 

■ 

dni). 


Then there exist (n, k, r, 3)-ELRC over the binary field that meet the bound (fTSl) . 

Binary codes with all-symbol locality r, availability t and code rate are constructed in ina for any positive integers r 


and t (such codes are a subclass of (n, fc, r, f)-ELRC). Eor < = 3, the code length is n = fc^^ = fc + ^ > fc + 
Hence is not optimal according to the bound (fTSl l. 


2fc-Kf1 


VI. Conclusions 

We investigate the problem of coding for distributed storage system that can locally repair up to t failed nodes, where f is a 
given positive integer. Given the code dimension fc, the repair locality r and t € {2,3}, we derive a lower bound on the code 
length n under the functional repair model. We also give some constructions of exact LRCs for t € {2, 3} with binary field 
and whose length n achieves the corresponding bounds, which proves the tightness of our bounds and also implies that there 
is no gap between the optimal code length of functional LRCs and exact LRCs for certain sets of parameters. 

Some problems are still open. Eor example, what is the optimal code length for f > 4? Given n, fc, r and t, what is the upper 
bound of the minimum distance dl Another interesting problem is to construct functional locally repairable codes {C\; A G A} 
with small size of A. 


Appendix A 
Proof of Claim 1 

To prove Claim 1, the key is to prove the following two statements: a) Eor each v € B U Ci, |£’biue(t^)| > 1; b) Each blue 
edge belongs to at most r different v € B U Ci. 

Eor each v G B,hy (fTSl l. |Out(u)| = 2. So we can assume Out(u) = {vi,V 2 ]- Then vi,V 2 are two inner nodes of By 
1) of Corollary [Tbl Out(z;i) 7^ 0 or Out(?;2) 7^ 0 . Without loss of generality, we can assume Out(z;i) 7^ 0 and 1:3 G Out(?;i). 
Then we have the following two cases: 

Case 1: (vi,V3) is not a green edge. Since vi is an inner node, then (vi,V3) is not a red edge. Note that v G B and 
vi G Out(u). Then (vijVs) is a blue edge belonging to v. 

Case 2: (uijUs) is a green edge. Then {ui} = Out(tt) for some tt G Ci U € 2 - By 2) of Corollary [16] Out(ii2) 7^ 0 . Let 
U4 G Out(w2). Since V 2 is an inner node, then {v 2 ,Vi) is not a red edge. Not that by 3) of Corollary [TSI |Out(ri;)| > 2 for 
any source w G In(?;2). (As illustrated in Eig. (EJa).) Then (Ci U C 2 ) Cl In(w2) = 0 , which implies that V 2 ^ Out(?7i) for any 
m G Cl U C2. So {v 2 ,va) is not a green edge. Since v G B and V 2 G Out(u), then {v 2 ,va) is a blue edge belonging to v. 




Fig 12. Illustration of the local graph in the proof of Claim 1. 

In both cases, we can find a blue edge belonging to v. 

Eor each v G C\, by (|T9j), |Out(r;)| = |Out^(i;)| = 1. We can assume Out(i;) = {ui} and Out^(i;) = {^2}. Then vi,V 2 
are two inner nodes. By 2) of Corollary [T3I we have Out^(ti) = Out(tii) = {1^2}- Eurther, by 3) of Corollary [T3I we have 
Out(ti2) 7^ 0. Let V 3 G Out(?;2). Since V 2 is an inner node, the edge (^2,1^3) is not a red edge. Not that by 4) of Corollary [T3I 
|Out(u)| > 2 for any source u G In(w2). (As illustrated in Eig. fT^ b').') Then we have (Ci U C2) H In(?;2) = 0, which implies 
that V 2 ^ Out(it) for any u G Ci U C2. So {v 2 ,V 3 ) is not a green edge. Note that v G Ci and Out^(u) = Out(?;i) = {^2}. So 
{v 2 ,V 3 ) is a blue edge belonging to v. 
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By the above discussion, we proved that |£’biue('u)| > 1 for each v € BU Ci, which proves the statement a). 

Let {u',u") be a blue edge and S be the set of all u S i? U Ci such that {u',u") belongs to v. For each u G S', we pick a 
ipiv) G In(u') depending on the following two cases; 

Case 1: V € B. Since {u',u”) is a blue edge belongs to v, then u' G Out('(;), which implies v G In(M'). Pick ip{v) = v. 
Case 2: u G Ci. By ( fT^ . |Out^(u)| = |Out(?;)| = 1. Denote Out(u) = {u'}. Then by 2) of Corollary [T3l Out^(u) = Out(u'). 
Moreover, since {u',u") is a blue edge belongs to v, then u' G Out^(u) = Out(u'). So v' G In(M'). Pick (f{v) = v'. 

If V and w are two different sources in S fl Ci, by 5) of Corollary [T^ their out-neighbors are different. So (p{v) ^ ‘fiiw). 
Thus, is a one-to-one correspondence between S and a subset of In(u'). Note that |In(rt')| < r. So |S| < |In(M')| < r. Thus, 
u") belongs to at most r different v G B U Ci, which proves the statement b). 

By statements a) and b), we have |fbiue| > which proves Claim 1. 


Appendix B 
Proof of Lemma |27] 

We need to consider two cases, i.e., r\k and r \ k. 

Case 1: r\k. We can let k = mr. Then rj = = 2m and m = [^J. By assumption of Lemma IZ7l m = [^J > r. We 

assign the elements of [fc] in a r x m array D = (uij)ie[r],je[m] in Fig- [l3]such that [fc] = {aij-,i G [r],j G [m]}. For 
each j G [m], let Aj = G [r]}. Then \Ai\ = r,Vi G [m]. In Fig. [13] each subset Ai is represented by a red line. 


I7l,l 

1 

ai,2 • ■ 

1 

1 

172,1 

! 

02,2 • ■ 

1 

! 

i 

®r,2 • • 

i 

• ^r,m 


Fig 13. Partition of [n]: Each subset is represented by a red line. 


Let 5 = r and C = {Ai, ■ ■ ■ , Am}- Then \Ai\ = S for each i G [m]. Let = r^Mi G [m\. Then I- 

Since m > r = rj, Vi G [m], then by Corollary l24l there exists a collection {i?i, • • ■ , of subsets of UJLi = W\ 
satisfies the following three properties: 


• i?i, • • • , Bm are pairwise disjoint and IJ™ Bi = IJ^i 

• \Bi\ = ri = r for all i G [m]; 

• \Bi n Ajj < 1 for all i,j G [m]. 

For each i G [m], let Am+i = Bi. Then it is easy to check that A = {Ai, ■ ■ ■ , satisfies conditions (i)—(iii) of Lemma 
[27] where rj = \^~\ = 2m. 

Case 2; r \ k. Let m = [^]. Since r } k, then m — 1 = and k = (m — l)r -f A, where 0 < A < r. By assumption of 
Lemma l27l we have 



Let a = m — 1 — (r — A). We can assign elements of [fc] in an r x m array D = {aij)i(=[r]je[m] in Fig. [T4| such that 
{a^J■,i G [r],j G [m - 1]} U {a^^rn'A G [A]} = [k] and = 0, Vi G {A -f 1, • • • , r}. Let 


Ao = {ai.j-J G {a -f 1, • • • ,m - 1}}. 


Then |Ao| = (m — 1) — a = r — A. Let 


— 


{a,j;iG[r]}, ifjG[m-l]; 

{ai.m;i G [A]} UAo, if j = m. 


In Fig. [T4| each subset in {Ai, • • ■ , Am-i} is represented by a red solid line and Am is represented by a red dashed line. For 
convenience, we call each subset in {Ai, • • • , Am} a red line. Clearly, \Aj \ = r and \Aj fl Aji \ < 1 for all j ^ j' G [m]. 

For each j G [m], let Cj = Aj\Ao. Then Ci,--- ,Cm are pairwise disjoint and = [fc]\Ao. So |lJ^iQ)l = 

|[A:]\Ao| = k — r + X. Moreover, we have 

{ r, ifjG[a]; 

r-1, if j G {a-f 1,-• • ,m - 1}; 

A, if j = m. 
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Fig 14. 


Ul,l 

1 

• • di^a 

1 

dl,a+l - 

1 


- fll, 

1 

1 

02,1 

! 

1 

• 02,a 

! 

1 

02,a+l 
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• 02,m-l 

! 

^2, 

1 

1 

i 

0>,,1 

1 

i 

i 

OA,a+l 

1 

i 

• • — l 

1 

o-x, 

1 

oa+1,1 

! 

• • ^A+1,q 

! 

1 

OA+1,q + 1 

! 

• OA+l,m-l 

! 

0 

i 

dr A 

i 

. . dr^a 

i 

<^r,Q+l 

i 

• • —1 

0 


Construction of subsets: Each of the first m — 1 subsets is represented by a red solid line and the mth subset is represented by a red dashed line. 


Let p = \. Then k — r + \ can be represented as the sum of p positive integers (not necessarily different) ri, • • • , rp 
such that ri < r,\/i € [p]. Since m — 1 > r, using the Gale-Ryser Theorem, we can construct an m x p binary matrix 
M such that for each i € [p] and each j £ [m], the sum of the ith row is and the sum of the jth column is \Cj\. Let 
C = {Cl, • • ■ , Cm}. By Lemma l22l there exists a collection {Bi, • • • , Bp} of subsets of IJ^i — [fc]\^o such that 

• Bi, ■ ■ ■ , Bp are pairwise disjoint and IJf^i Bi = Ujli — [^]\^o; 

. \Bi \ = n for all i G [p]; 

• \Bi C\Cj\ < 1 for all i £ [p] and j £ [m]. 

Now, for each i £ [p], let A^+i = Bi. Note that k = {m — l)r + A and p = . Then to + p = to + = 

I" mr+k-r+x ^ _ = 7 ^ Thus, we obtain a collection A = {Ai, • • • ,Ap} of rj subsets of [/c]. For convenience, we call each 

subset in {Am+i, ■ ■ ■ , Ar^} a blue line. 

By the construction, we have \Ai\ < r for each i £ [p]. So condition (i) of Lemma [27] is satisfied. 

Again by the construction, we have the following observations; 1) Each i G Aq belongs to exactly two red lines and each 

i £ [fc]\Ao belongs to one red line and one blue line; 2) Any two different red lines has at most one point (element) in 

common; 3) Any two different blue lines have no point (element) in common; 4) A red line and a blue line have at most one 
point (element) in common. 

Observation 1) implies that each i £ [k] belongs to exactly two subsets in A. So condition (iii) of Lemma l27l is satisfied. 
Moreover, observations 2)—4) imply that any two different lines have at most one point (element) in common. So condition 
(ii) of Lemma |22] is satisfied. 

Thus, we can always construct a collection of p = \^~\ subsets of [fc] that satisfies conditions (i)—(iii) of Lemma l27l 

Appendix C 
Proof of Lemma [35] 

We will construct a set 7^ = {RLi, ■ ■ ■ , RLm} of red lines and a set B = {BLi, • • • , BL/,} of blue lines and prove that 
TZU B is a mesh of [n]. 

Since to = [^] and by assumption of Lemma r\k, then k = mr and k + m = {r + 1)to. We can assign the elements 
of [k + to] in an (r + 1) x to array D = (aij)igp+i],je[m] in Fig. [15] such that [fc] = {aij;i £ [r],j £ [to]} and 
Qr+ij = k + j, Vj £ [to]. For each j £ [to], we let RLj = {aij;i £ [r + 1]}. In Fig. [15] each subset in {RLi, • • • , RLm} 
is represented by a red solid line. 

0 ^ 1,1 O '!,2 

I I 

^ 2,1 0 ^ 2,2 


CtrA Qr,2 

I I 

fc + 1 k + 2 

Fig 15. ConstiTiction of red lines: Each red line is a column of the aiTay. 

Since k = mr, then i = — rri = . Hence, k + m can be represented as the sum of i positive integers 

ri, • • • ,re such that Vi < r for each i £ [P\. Let C = {RLi, • • • , RLm] and S = r + 1. Note that by assumption of Lemma 
Its] to > r. So we have Vi < r < m for each * £ [£]. By Corollary [24] there exists a collection (Hi, • • • ,Bi} of subsets of 
U^i '^hat satisfies the following properties: 


I 

^2,m 


(^r,m 

I 

k + m 
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• Bi, • • • , are pairwise disjoint and lJi=i = U^i = [^ + 

• \Bi\ = Ti for all i G [£]; 

• I Si n I < 1 for all i G [^] and j G [to] . 

For each i G [-^j, let BLi = Bi U {k + m + i}, and let B = {BLi, • • • , BL^}. 

By the construction, it is easy to check that conditions (i), (ii), (iv) of Definition are satisfied. 

By the construction, we also have the following observations: 1) 72. is a partition of [fc + to]; 2) is a partition of [n]; 3) 
\BLi n RLj\ < 1 for all i G [t] and j G [to]. 

By the above observations, we can easily check that conditions (iii), (v) of Definition are satisfied. 

So 72 U B is a mesh of [n]. 


Appendix D 
Proof of Lemma[36] 

We will construct a set 72 = {777^1, • • • , RLm] of red lines and a set B = {77Li, • • • , BLi} of blue lines and prove that 
72 U B is a mesh of [n]. 

Since to = [^] and A = r mod fc > 0, then 

fc = (TO—l)r + A. (33) 

Hence, k + m = (to — l)r + A + to = (to — l)(r + 1) + (A + 1). We can assign the elements of [k + to] in an (r + 1) x to 
array D = (oijjigp+i]jg[m+i] as in Fig. [Tbl such that [k + m] = {oij; i G [r + 1], j G [to - 1]} U G [A + 1]} and 

ai,m+i = 0 for j G {A + 2, • • • , r + 1}. Moreover, by proper permutation (if necessary), we can let 0^+1,^ = k + j for each 
j G [to — 1] and oa+i,™ = k + m. We can construct 72 = {7?Li, • • • , RLm} and B = {77Li, • • • , BLm+\} by the following 
three steps. 
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• - Ctl,Tn-l-- 

1 


1 

0'2,1 

] 

1 

Cl2,a 

! 

1 

0^2,a+l 

] 

1 

0^2,m~l 

] 

^2,m 

1 

1 

i 

^A.l 
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i 
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1 
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1 

“A+1,1 • 
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1 
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1 

1 

• ^A+l,Tn —1 

1 
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1 

aA+2,1 • 

] 

1 

• (^X+2,a 

1 

1 

a'A+2,Q:+l 

1 

1 

• ^A+2,Tn —1 

1 

0 

i 

dr.l 

1 

i 

• ^r,ct 

1 

i 

dr^a+l 

1 

i 

1 

0 

1 

k 1 

1 

• k + a 

1 

A: + a + 1 

1 

k m — 1 
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Fig 16. Construction of red lines of [n]: The first m — 1 red lines are the first m — 1 columns of the array and the last red line is depicted by a dashed 
red line, where a = m — 1 — (r — A). 

Step 1: Construct 72 = {7?Li, • • • , RLm}- 
Denote 


q; = to — 1 — (r — A) 


(34) 


and for each 7 G [r + 1], let 


Ai = {aij;j G {a+ 1, • • • ,to - 1}}. 


Then we have \Ai\ = m — 1 — a = 
For each j G [to], let 


r — X,\/i € [r + 1]. 


RLj — 


{aij;i G [r + 1]}, 

{o-i,m', * G [A + 1]} U Ai, 


if j G [to - 1]; 

if j = TO + 1. 


In Fig. [T^ each subset in {7?Li, • • • , RLm-i} is represented by a red solid line and RLm is represented by a red dashed line. 
Clearly, |7?Li| = r + 1 for all z G [to — 1]. Moreover, by the construction, \RLm\ = \Ai\ + A + 1 = (r — A) + (A + 1) = r + 1. 
So we have |7?Li| = r + 1 for all i G [m]. 

Step 2: Partition RLi. 

By assumption of this lemma, to > 2r — A + 1, which implies that to — 1 — (r — A) > r. So by ( l34l i. we have 


a = TO — 1 — (r — A) > r. 
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Let 


= a{r + 1) — (A + l)(r — 1) — rA 


and 

h = e-{X + l)-r. 

By assumption of this lemma, £ > A + 1 + r. So we have h > 0. Moreover, note that 


r/31 


a(r + 1) — rA — (A + l)(r — 1) 


r 


r 




(m — 1 — r + A)(r + 1) — rA — 

A + l)(r-l) 


2[(to — l)r + A] + TO 


— TO — (A + 1) — r 


2 k + m 


— TO — (A + 1) — ; 


= £- (A + l)-r 
= h. 


(35) 

(36) 


So [3 can be represented as the sum of h positive integers, say ri, • • ■ ,rh, such that < r, Vi £ [h]. Moreover, we let 


Then by dTSl ) and ( |36] |. we have 


n = 


T — 1, ifi^l/i+l,*-* ,/i + A-fl}; 
A, if i e {/i +A + 2,---,f}. 


t h h+A+1 t 

^n = ^n+ ^ r,+ ^ n 

i—1 2=1 i—h-\-X-\-2 

= /3 + (A + l)(r -l) + {£-h-X-l)X 
= P + {X + l)(r - 1 ) + rA 
= a{r + 1) 

OL 

= \Jrl, . 

i=l 


Let C = {RLi, • • • , RLa} and 5 = r + 1- Note that < r < a = |£|, Vi £ [£]. Then by Corollary |24l there exists a 
collection {Si, • • • ,Bf} of subsets of Uti satishes the following three properties: 

• Bi, ■ ■ ■ , Bg are pairwise disjoint and lJi=i Ri = Uti RLi'^ 

• |Si| = n for all i £ [£]■, 

• I Si n RLj I < 1 for all i £ [£] and j £ [a]. 

Step 3: For each i £ [£], extend Bi to BLi. 

For each i £ [h], let 

BLi = Si U {/c + TO + i}; 


For each i £ {/i + 1, • • • , h + A + 1}, let 

BLi — Si U {ui_ii ^ 771 - 1 - 1 , k T 77Z T 2}, 

For each i £ {/i + A + 2, • • • , £}, let 

BLi = Si U Ai-h-\ U (A: + TO + i}. 

Note that by ( l36b . we have £ — h — X = r+1. So for each i £ {h + X + 2, ■ ■ ■ ,£}, we have i — h — X G {2, • • • , r + 1}. 
Hence, BLi is reasonably constructed and Ai fl BLi = 0- 

By the construction, it is easy to see that conditions (i), (ii) of Dehnition are satished. Moreover, we can see that each 
point in Ai belongs to two red lines and each point in [k + to]\Ai belongs to a red line and a blue line. So condition (iii) of 
Dehnition is satished. 

By the construction, we also have the following observations: 1) ISL771 fl RLi\ = 0 for i £ [a]; 2) ISL771 fl RLj\ = 1 for 
j £ {a + 1, • • • , TO — 1}; 3) If i,j £ [to — 1] and i ^ j, then RLi and RLj have no point in common; 4) A red line and a 
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blue line have at most one point in common; 5) Two different blue lines have no point in common; 6) If a blue line intersects 
with RLm, then it does not intersect with RLi for alHsla + l,-- - ,m — 1}. 

Note that observations 1)—3) imply that any two different red lines have at most one point in common. Hence observations 
1)—5) imply that condition (iv) of Definition [3^ is satisfied. Now suppose that two lines, say Li and L 2 , intersect with RLi 
for some i G [m]. We have the following three cases: 

Case \-. i G [a]. Then by observations 1) and 3), Li and L 2 are two different blue lines. So by observation 5), Li and L 2 
have no point in common. 

Case 2: i G {a + 1, ■ ■ ■ ,m — 1}. Then by observations 2) and 3), we have the following two subcases. 

Case 2.1: Li is RLm and L 2 is a blue line. By observation 6), Li and L 2 have no point in common. 

Case 2.2: Li and L 2 are two different blue lines. Then by observation 5), Li and L 2 have no point in common. 

Case 3: i = m. Then by observations 1) and 2), we have the following three subcases. 

Case 3.1: Li is RLi for some i G {a + 1, - ■ ■ ,m — 1} and L 2 is a blue line. By observation 6), Li and L 2 have no point 
in common. 

Case 3.2: Li = RLi and L 2 = RLj for some i,j G [to — 1] and i ^ j. By observation 3), Li and L 2 have no point in 
common. 

Case 3.3: Li and L 2 are two different blue lines. Then by observation 5), Li and L 2 have no point in common. 

By above discussion, we proved that condition (v) of Definition is satisfied. 

So 7?. U 6 is a mesh of [n]. 
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